Document Clustering for Social Problem Detection and Cluster Evaluation Measures
نویسندگان
چکیده
منابع مشابه
Association Coefficient Measures for Document Clustering
This paper presents Association Coefficient Measures for Document Clustering. The proposed Association Coefficient Measures approach is based on Intuitionistic Fuzzy Sets. In this paper twelve Association Coefficient Measures from f1 to f12 are used. In Document Clustering Document collection, Text Pre-processing, Feature Selection, Indexing, Clustering Process and Results Analysis steps are us...
متن کاملCandidate Cluster Extraction for Hierarchical Document Clustering
Text Document are tremendously increasing in the internet, the hierarchical document clustering has proven to be useful in grouping similar document for large applications. Still most documents suffer from problems of high dimensionality, scalability, accuracy and meaningful cluster labels. In this paper an new approach fuzzy frequent itemsets based hierarchical clustering is proposed, in which...
متن کاملSimilarity Measures for Text Document Clustering
Clustering is a useful technique that organizes a large quantity of unordered text documents into a small number of meaningful and coherent clusters, thereby providing a basis for intuitive and informative navigation and browsing mechanisms. Partitional clustering algorithms have been recognized to be more suitable as opposed to the hierarchical clustering schemes for processing large datasets....
متن کاملOptimum Cluster Labeling and Document Clustering for Forensic Analysis
Document clustering or unsupervised document classification is an automated process of grouping documents with similar content. Document clustering is an important task in many Information Retrieval systems. Also document clustering Algorithms can help in discovery of new and useful knowledge or novel class from the documents under analysis. This knowledge or novel class is very important issue...
متن کاملChallenging Issues and Similarity Measures for Web Document Clustering
Web itself contains a large amount of documents available in electronic form. The available documents are in various forms and the information in them is not in organized form. The lack of organization of materials in the WWW motivates people to automatically manage the huge amount of information. Textmining refers generally to the process of extracting interesting and non-trivial information a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Transactions of the Japanese Society for Artificial Intelligence
سال: 2009
ISSN: 1346-0714,1346-8030
DOI: 10.1527/tjsai.24.333